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INTRODUCTION 



ABSTRACT 

We develop optimised estimators of two sorts of power spectra for fields defined on the sky, in the pres- 
ence of partial sky coverage. The first is the cross-power spectrum of two fields on the sky; the second 
is the skew spectrum of three fields. The cross-power spectrum of the Cosmic Microwave Background 
(CMB) sky with tracers of large-scale-structure is useful as it provides valuable information on cosmo- 
logical parameters. Numerous recent studies have proved the usefulness of cross-correlating CMB sky 
with external data sets, which probes the Integrated Sachs Wolfe Effect (ISW) at large angular scales 
and the Sunyaev Zeldovich (SZ) effect from hot gas in clusters at small angular scales. The skew spec- 
trum, recently introduced by Munshi & Heavens (2009), is an optimised statistic which can be tuned to 
study a particular form of non-gaussianity, such as may arise in the early Universe, but which retains 
information on the nature of non-gaussianity. In this paper we develop the mathematical formalism 
for the skew spectrum of 3 different fields. When applied to the CMB, this allows us to explore the 
contamination of the skew spectrum by secondary sources of CMB fluctuations. Considering the three- 
point function, the study of the bispectrum provides valuable information regarding cross-correlation of 
secondaries with lensing of CMB with much higher significance compared to just the study involving 
CMB sky alone. After developing the analytical model we use them to study specific cases of cosmo- 
logical interest which include cross-correlating CMB with various large scale tracers to probe ISW and 
SZ effects for cross spectral analysis. Next we use the formalism to study the signal-to-noise ratio for 
detection of the weak lensing of the CMB by cross-correlating it with different tracers as well as point 
sources for CMB experiments such as Planck. 
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Meth- 



Observations of cosmic microwave background (CMB) and that of large scale structure carry complementary cosmological information. While all-sky 
CMB observations such as NASA's WMAPQ and ESA's current Planck experiments primarily probes the distribution of matter and radiation at 
redshift z — 1300, large-scale surveys tend to give us a window at lower redshift z ~ 0. The main advantage of cross-correlating such independent data 
sets lies in the fact th at it is possible to high light signals which may not be otherwise detected in individual data sets independently. Earlier studies in 
this direction include, Peiris &Spergell (2000) who carried out a detailed error forecast of such cross-correlation analysis for cosmological parameters. 
Clearly, for these tracers to be effective in constraining cosmology, they should be as numerous as possible to reduce the Poisson noise and the survey 
should cover as large a fraction of the sky as possible to reduce sample variance. 

Various authors have used different external data sets with specific astrophysical tracers to trace the large-scale structure (LSS), with one of the 
main motiv ations being to detect the ISW effect as predicted for ACDM cosmology. Earlier studies in this direction include iFosabala & G aztanaga 
J2004l2006h who cross-correlated the SDSS Data Release 1 galaxies with the first-year full sky WMAP data. Nolta et al. (2003) cross-correlated the 
NVSS radio source catalogue with first-year full sky WMAP data. Scranton et al. (2003) who correlated Sloan Digital Survey against WMAP data. 
iBoughn & Crittenderi ( 2005 , 2004al lbT) used two tracers of the large-scale structure: the HEAOl A2 full sky hard X-ray map and NVSS full sky radio 
galaxy survey. A maximum likelihood fit to both data sets yields a detection of an ISW amplitude at a level consistent with what is predicted by 
the A CDM cosmology. Most of the se studies detec ted ISW effect at a level of 2-3 a although the error ana l ysis m odels and the statistic used were 
sometimes completely different (see lHo et. alj d2008l) for tomographic studies involving ISW and lHirata et al.l d2008l) for weak lensing detection). The 
ISW effect remains one of the most direct and quantitative measure of the dark energy available to us today. Future all-sky missions such as Planck 



1 http://map.gsfc.nasa.gov/ 
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will provide an excellent possibility to extend these studies to higher confidence regime. While the above studies are mainly focussed on large angular 
scales, where the ISW effect plays an important role, at small angular scales, the presence o f clusters and probably the a ssociated filamentary network 
in which they reside can also affect both CMB maps through the Sunyaev-Zeldovich effect ( Sun vaev & Zeldovich 1980) as well as through the X-ray 
ma ps via bremsstrahlung. Cross-correlation analysis o f the diffuse soft X-ray background maps of ROSAT with WMAP 1st year data were performed 
by toieeo. Silk. Sii w^Eool Ipie ^o.Silk. Silwal Eoblh. This study was motivated by the fact that hot gas in clusters can be more easily detected by 
cross-correlating X-ray and CMB maps. Although no evidence was found of this effect it opens the possibility of detecting such an effect in future 
high-resolution CMB maps. All these act as a motivation for development of a generic techniques to cross-correlate high-resolution CMB maps with 
other maps from LSS surveys. In this paper we focus on cross-correlating two or three different datasets, but the challenges are similar to those arising 
from a single dataset. For example, the estimation of the power spectrum from a single high-resolution map poses a formidable numerical problem 
in terms of computational requirements. Typically two different methods are followed. The first os the non-linear maximum likelihood met hod, or its 
quadr atic variant, which can be applied to smoothed degraded maps, as it is no t possible to direc tly invert a full pixel covariance matrix dTegmark] 
1997). To circ umvent this problem a p seudo-Ci s (PCL) technique was invented dHivon et alj|2002l) which is unbiased though remains suboptimal. In 
recent analysis lEfstathioul d200l,[2006) has shown how to optimize these estimators which can then be used to analyse high-resolution maps in a very 
fast and accurate way. We generalize the PCL-based approach here to compute the cross-correlation of different data sets. The method developed here 
is completely general and can be applied to an arbitrary number of data sets. For example, our formalism can analyse the degree of cross-correlation 
among various CMB surveys observing the same region of the sky with different noise levels and survey strategies. 

For near-Gaussian fields, two-point analysis from any cosmological survey provides the bulk of the cosmological information. Nevertheless, going 
one step f urther, at the level o f three- point correlation, the detection of departure from Gaussianity in the CMB can probe both primary non-Gaussianity 
sec (e.g. iMunshi & Hea^nl §009)) as well as the mode-coupling effects due to se condaries. The possibili t y of further improving a detection o f 
primordial non-Gaussianity with CMB maps, given current hints with WMAP data dYadav & Wandeltll2008l ; Smit h. Senatore & Zaldarriagall2009T) . 
provides further motivation in this direction. One of the prominent contributions to the secondary non-Gaussianity is the coupling of weak lensing 
and sources of secondary contributions such as SZ ('Gold berg & Sper gel 1999; Cooray & Hull2000l) . Although weak lensing produces a characteristic 
signature in the CMB angular power spectrum, its detection has proved to be difficult internally from CMB power spectrum alone. The non-Gaussianity 
imprinted by lensing into the primordial CMB remains below the detection level of current experiments, although with Planck the situation is likely 
to improve. The difficulty originates mainly due to the fact that such detections are linked to the four-point statistics of the lensing potential. However 
cross-correlating CMB data with external tracers means lensing signals can be probed at the level of the mixed bispectrum. After the first unsuccessful 
attempt to cross-correlate WMAP against SDSS, recent efforts by Smi th. Zahn & Dord d2007l) have found a clear signal of weak lensing of the CMB, by 
cross-correlating WMAP against NVSS which covers a significant fraction of the sky. Their work also underlines the link between three-point statistics 
estimators and the estimators for weak lensing effects on CMB. 

The study of non-Gaussianity is primarily focused on the bispectrum dHeavens|[l998l) . however in practice it is difficult to probe the entire 
configuration dependence in the harmonic space from noisy data. The cumulant c orrelators are multi - point correlators collapsed to probe two-point 
statistic. These were introduced in the con text of analyzing galaxy cluste ring by [Szapudi & Szalayj dl999h . and were later found to be useful for 
analyzing projected surveys such as APM dMunshi. Melott & Colesl 2000h. Bein g two-point statistics they can be analyzed in the multipole space 
by defining an associated power-spectrum. Recent studies bv lCoorav d2006h and Cooray. Li & Melchiorril d2008l) have shown its wider applicability 
including e.g. in 21cm studies. However, the multi-spectrum elements defined in multipole space are difficult to estimate directly from the data because 
of their complicated response to partial sky coverage and inhomogeneous noise, as well as associated high redundancy in the information content. 
However such issues are well understood in the context of power-spectrum analysis. Borrowing from previous results, in this paper we show how the 
cross-power spectrum and the skew spectrum can be studied in real data in an optimal way. We concentrate on two effects: the cross-correlation power 
spectrum, which is recovered by cross-correlating two different (but possibly correlated) data sets, focussing on weak lensing effects on the CMB, and 
secondly the contributions to the skew spectrum from foreground effects. The relation of such cross-power spectrum estimators with higher-order multi- 
spectra such as the bispectrum is also discussed in the context of methods known as pseudo-C; s and quadratic estimators. We derive the error-covariance 
matrices and discuss their validity in the signal- and noise-dominated regimes and comment on their relationship to the Fisher matrix. 

This layout of the paper is as follows: in §2 we use the formalism based on Pseudo-C^ analysis for power-spectra to study the cross-correlation 
power spectrum of different data sets. While we keep the analysis completely general, it is specialized for the case of near all-sky analysis and use 
it to compute the signal-to-noise and the covariance of estimated Cis for various tracers with Planck-type all-sky experiments. Possibilities of using 
various weights which can make the pseudo-Cf approach near optimal in limiting cases of the signal-dominated regime or the noise-dominated regime 
are also discussed. In §3 we continue our discussion on Pseudo-Cf but generalize it to the analysis of the skew spectrum. Such an estimator can handle 
the partial sky coverage and noise in a very straightforward way. However in general it remains sub-optimal. In the high-? regime where mode-mode 
coupling can be modelled using a fraction of sky f s k y proxy one can make such an estimator nearly optimum using a suitable weighting. After a very 
brief introduction to various physical effects in §4 which introduce mode-mode coupling that leads to CMB bispectra, we move on to develop a crude 
but fast estimator for the skew spectrum in §5. Section §6 is devoted to developing the mixed bispectrum analysis in an optimal way by introducing 
inverse covariance weighting of the data. We analyze both one-point and two-point collapsed bispectral analysis. The one-point estimator or the mixed 
skewness is introduced - being a one point estimator it compresses all available information in a bispectrum to a single number. Next, we introduce 
the mixed skew spectrum which compresses various components of a bispectrum to a power spectrum in an optimum way. Next, in §7, the general 
formalism of bispectral analysis is used for specific cases of interest. 



2 THE PSEUDO-C^ ESTIMATOR FOR CROSS-CORRELATION ANALYSIS 

In this section we generalise results from pseudo-Cf power spectrum estimation of a single field with partial sky coverage, to cross-power spectra of 
two fields. For two fields $ x , <E> y defined on the sky, the pseudo-Cf estimators are constructed from the spherical harmonic transforms a lt ^ over the 
partial sky, where the fields are assumed to take zero value in unobserved regions. 
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Figure 1. The power-spectra C;s are plotted for various analysis. In the left panel we show the power spectrum corresponding to a LSS tracer such as NVSS and that of 
CMB ISW. The power spectra associated with the two is also depicted. In the right panel we show SZ and ISW power-spectrum and their cross-correlation. The CMB 
power spectrum is also plotted in both panels. SZ curves include only the part that is correlated with the large-scale density field. 



The are related to the true all-sky spherical harmonics af^ by a linear transformation, via the transformation matrix Ki m i' m ', and where 
possible direct inversion to get aim is much faster than maximum-likelihood analysis. Using a suitable choice of weighting function for the data this 
estimation can also be made nearly optimal. 

2.1 Estimator for Cf' Y 

The transformation from the pixel-space to harmonic domain can be expressed as follows: 



~X,Y 

a,' 



J2 ^ X ' Y ^iM^) X ' Y ^i) X ' Y yirn(^i) = ^a^,KfJ m ,. (1) 



pixelsi 



Here w(&i) denotes the pixel space weight, Yi m (Cli) represents the spherical harmonic and Q p (Cli) is the pixel area (which we will assume independent 
of the pixel position). Expanding the weighting function in a spherical basis one can write down the coupling matrix Ki imi i 2m2 which encodes all 
information regarding mode-mode coupling due to partial sky coverage as (see e.g. lHivon et alj f2002h for detailed derivation): 

K, / w(fl)Yl 1 m 1 (&)Yl 2 m 2 (f2)dn 

- ( (2h + l)(2l 2 + l)(2l 3 + l) \ 1/2 ( k h h\ ( h h h \ 

w h™ 3 7Z n n n ™ j ( 2 ) 



^ 13,713 I 4tt J \ ) I mi m 2 m 3 

where wi m is the transform of the window or (arbitrary) weighting function. The matrices represents 3J functions. The quantum numbers I and m 
need to satisfy certain conditions for the 3 J functions to have non-vanishing values. The pixel area of maps <1> X and <E> y will be denoted by and the 
associated weights with each pixels will be left arbitrary w i ' . Adopting the notation of Efstathiou (2004), we write the pseudo-C £ ' ' in terms of 
the underlying true power-spectrum Cf' Y ' XY s. 
Defining the following: 

(C?) = ( 2 £+ l) ^2 l 5 ^! 2 ' {Cf y ) = ( 2£ X + ^ ^2 Real{af m * aj m ) (3) 

m m 

and similarly for (Cf), we have 

(C?)=J2 C ?' M U'> (4) 

i 

where a — X, Y or XY, and we can estimate the true covariance matrices as 

<9? = (M-^qg. (5) 

where the matrix M can be expressed in terms of 3 J symbols as dHivon et alj|2002l) : 

- + 1) E ^|^^ ( t l l l o ) 2 (6) 

and Wi — jjq-j- ^ m l w ( m 1 2 is the power spectra associated with the mask. Note that the transformation matrices M X ' Y depends on the power- 
spectrum of the weighting function u>^ Y , whereas the matrix M XY for cross-power spectra is determined by the cross-power spectra W XY of two 
weighting functions. Independent of the choice of weights the estimators C" remain unbiased. 
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Figure 2. The variance for estimated C'i s are plotted. The variance for estimated C; corresponding to ISW, SZ and their cross-correlation are plotted in the left panel. The 
ISW vs local tracers (NVSS type) analysis is plotted in the right. The number density of galaxies for NVSS-type survey was taken to be TV = 7 X 10 s per Steradian. For 
CMB a Planck type experiment was assumed. Results are for all-sky surveys. Results plotted are for f s ky = 1. For near all sky survey, the variances will scale linearly 
with f sky . 

2.2 Covariances of Pseudo-C^s 



The pseudo-Cf s are unbiased. The variances of these estimators can be computed analytically for arbitrary sky coverage and a non-uniform Gaussian 
noise distribution. The deviation of the estimated Ci from the ensemble average (Ci) is denoted by SCi. 



8C? = C?-(C?).; aeX,Y,XY 

We are concerned here with the computation of the covariance of estimated Cis. We begin by defining the covariance matrix: 

(SCtStf) = (C?C?) - (Cr)(Cf,); a,f3eX,Y,XY 

The covariance of Cis from individual surveys $ x or $ Y can be expressed as follows jEfstathiouliooi) . 



(V) 
(8) 



\wlm\ \w lm \ +Ci 

L M 



2L 



1 V"^ I X |2i/ 2\Y i2 

~p[ \ Wlm \ \{ wa > LM \ 



1 V"^ I X i2i/ 2\Y 1 2 "I 

' 2 L + 1 ZJ™ iM \( wa > LM \ ) 



L £ £' 




(9) 



Extending the above results similarly the covariance of Cf for the cross-power-spectrum can be expressed as: 



{^ x d Y 



2L 



1 \ I X i2i Y |2 , n X 

~j— j"Z^' WLM ' \ w lm\ +C e 



2L 



1 \ I X i2i/ 2 2sX |2 
— 2^ \ w lm\ \{W a ) LM \ 



+Ci Y 2L \ 1 ^2 \ w lm\ 2 \(w 2 (t 2 )Im\ 2 + 2L + 1 ^2 \( w2a2 )LM\ 2 \(w 2 a 2 )l M \ 2 ^ 

M M 

In our derivation we have assumed that all three power spectra are being estimated from the data simultaneously. 
The three off-diagonal terms can be similarly expressed as: 

2 



Lit 




2L + 1 
2tt 



Lit 




(10) 
(11) 



(12) 



L M 

Here we have introduced following notations: 



X Y * II X I 

w LM w LM \\w LM \ 



2 . n XY 1 \ "~ / 2 2-.X Y* :\ 

2L + l 2-^^ W a > lmWlm \) 



( 2L + 1 
V 2tt 



L £ I' 




(13) 



Whn = / dQw x (Q)Yi m (Q) 



(wa 2 ) x m = / d(lw x (Q,)a 2 x {yi)Yi m {Cl) 



, 2 2\X 
{ui a )lm 



dnw x (n) 2 a 2 x (n)Y lm (n). 



(14) 
(15) 
(16) 
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Figure 3. The cumulative signal to noise for ISW cross LSS (right panel) and SZ cross ISW (left panel) are plotted as a function of I. The results are obtained by using the 
covariances of C;s plotted in previous plot. The S/N will degrade linearly with f aky . Plots correspond to f a k y = 1. 



Similar expression hold for the second survey and the cross terms for product of two surveys are also needed to derive the error covariance matrices. 
Finally the error covariances associated with deconvolved estimators Ci can be expressed in terms of that of the convolved estimators C\ as follows: 

( (SOftC*,) {5Ci8Gl)\_( M£* Ml? y 1 ( (5C?5C*) QC? 8C$) \ ( Uft Mf Y \ lT 

{SCISC*} (SClSGl) J { M Y L ? M Y J ) \ (SCT8C?) (SC Y 5C?) ) { Mtf M YY J (17) 

In deriving these results it is assumed that the coverage of the sky is near complete. This will mean that the windows associated with the various 
couplings are sharper than any features in the power spectra. The shape of the mask and the noise covariance properties are quite general at this stage. If 
the C;s of individual data sets are known from independent estimations then cross spectra deconvolution of the cross-spectra C?' x can be simply 
written as: 

(C X L Y C X L ; Y ) = ^[M^ y ]-(A X,r ^' y )[M Y ]^. (18) 

IV 

In the limiting situation when the survey area covers almost the entire sky these equation takes a much simpler form which are in common use in the 
literature. If f s k y is the fraction of the sky covered then one can write: 

(SCf Y 5C{f Y ) = f sky ^±^[cfcr + (C? Y ) 2 ]5w, {5Cf Y SC? Y ) = f sky ^-^(C? Y C? Y )6 W ■ (19) 

The Cjs in these expression are the total Ci = Cf + Cf which takes contribution from both signal and noise C;s. 



3 THE PSEUDO-C^ ESTIMATOR FOR MIXED BISPECTRUM ANALYSIS 

The statistics of temperature fluctuations in the sky are very nearly Gaussian, but small departures from Gaussianity can put constraints on early universe 
scenarios. Secondary non-Gaussianity on the other hand can provide valuable information to distinguish structure formation scenarios, and when used 
with constraints from the power spectr um it can be a very valua ble tool. However estimation of the bispectrum for each triplet of harmonics modes 
can be difficult to perform numerically. iMunshi & Heavens] d2009l) introduced a limited data compression method for 3-point functions which reduces 
the data to a single function (the skew spectrum), and which can be made optimal for estimating a bispectrum form. In the same spirit, we define a 
pseudo-skew spectrum for three arbitrary fields defined on a cut sky, and show how it is related to the skew spectrum on the uncut sky. The PCL-based 
approach described here is not optimal, however, it can be made optimal with suitable choice of weights. 

Let us assume that we have three fields which are defined over the observed sky. The product of two of these fields as X(fi)Y(0) has an associated 
mask which, we denote as wa(£1), and which is a product of two masks associated with the individual fields. Analogously, the third field Z(fi) is 
observed with a mask Wb(£1)- 

From the harmonic transforms of these fields we study the skew spectrum and express it in terms of the mixed bispectra of fields X, Y and Z, 
^hhh • ^ e develop this generally, but the results we derive will be useful for the study of primordial non-Gaussianity. Here we consider a single field 
(the CMB), but it is a field with contributions from various components, and the skew spectrum contains terms from various triplet of different (or 
repeated) fields. 



3.1 Estimator for Cf Y ' z 

3.1.1 All-Sky Analysis 

We start by introducing the power spectrum C ; XY ' Z associated with the cross-correlation of the product map X(f2)Y(f2) and Z(f2). In the absence of 
sky-cuts and instrumental noise we can write: 
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/>XY,Z 1 \ ^ XY Z* 

1 = 21 + 1 ' J aim aim 

m 

where af^ is the spherical harmonic transform of XY. 

Assuming homogeneity and isotropy, the correlation function C(Sl, Q') of X(Q) Y(Q) and Z(Sl) can be written in terms of C ; XY,Z 

c(n,n') = (x(n)Y(n)z(n')>= (^mXmJiWW^w^') = J-^(2/ + i)p i (cos(n.n'))C' ! XY ' z ( 21 ) 

l\ra\,l 2 ra 2 ^ 

Here P; (fi • ft') is a Legendre polynomial of order Z. The three-point correlation function in the harmonic domain can similarly be used to introduce 
the mixed bispectrum B t J[3 for the related fields. Assuming statistical isotropy, 

/ x y z v _ ( h h h \ d xyz 

\ a l 1 m 1 a l 2 m 2 a l 3 m 3 ) — I „, „ I &l\l 2 l 3 - \ LL ) 

Our aim is to compute the cross-correlation power spectra of the product field ~X.(Q,)Y(Q) and Z(Q). Using a harmonic decomposition we can relate 
the multipoles af^ with multipoles a x m / and a V i m ii : 

ail = y dfty,^(ft)x(ft)Y(ft) = X) a ^'°^™" / ./i>>-..,.fi>i>- ; -,./ii>i>;-„.-ii.)i 

Z'm' I" m" 



x Y / (2ii + i)(2k + i)(2k + i) / z /' i" \ i i' v \ , ,s 

^^a iW a iw ,W , „ . (23) 



47T \ / \ m m m 

Contr acting with the multipole of the remaining field af, m , we can see that it directly probes the mixed bispectrum associated with these three different 



fields (Cooray 2001a). 



/ XY Z* v _ p XY,Z <: c r XY,Z _ \ ^ R XYZ /(2/i + l)(2Z 2 + l) / h h h \ mAS 

{a lm a Vm ,)=C l d w d mm >; C ; = 2^ B Uii 2 \J 47r(2 ^ + fj ^ Oj ( ) 

Since the bispectrum is determined by triangular configuration in the multipole space (Zi, 1%, I), the power spectrum Ci defined above captures infor- 
mation about all possible triangular configuration when one of its sides is fixed at length I. However this data compression is not done optimally as it 
does not weight the contributions from each bispectrum components with their inverse variance. The error covariance matrix can be computed exactly 
and depends on higher-order moments of signal and noise, as well as their cross-correlations. 



3.1.2 Partial sky Coverage 

It is possible to extend the above result to take into account partial sky coverage. Assuming the composite map X(Cl)Y(Q) is masked with arbitrary 
mask liu(fi) and the map Z(Q) is masked with tu_e(f2) we can write the cut-sky multipoles fi^J and af m in terms of their all sky counterparts as well 
as the multipoles associated with the mask multipoles follows: 

= / x(n)y(nwn)Y^(n)dn ; <>?,.,= I zai)w D ai)Y,uu),ni 



J X(fi)y(n)w A («)Y4(fi)d«; af m = J Z{(l)w B {h)YC m {(l)dn 

afm = X/ X/ X/ a hmi a hm2 W lAmA / dQ,Yi imi (£l)Yi 2m2 (Q)Yi AmA (£l) 

i — i i . — . " 



l^mi l 2 m 2 I A m A 



afm = ^ ^ af 3m3 i"iBmB J d(lYi 3m3 {tl)Yi BmB {(l). (26) 



'3 m 3 ls m B 



Here S( 1 ; 2 ( 3 is the angle-averaged bispectrum and the functions bi 1 represent the effects of pixellisation as well as beam smoothing. For partial sky 
coverage one can obtain after tedious but straight forward algebra: 



- XY , Z ^ 21' + 1 ^ (21" + 1) ( I V l"V, |2V- r xyz / (2Fi + l)(2fa + l) f h h V \_ M axy.z nl . 



This is one of the important results in this paper. It shows how the pseudo-skew spectrum is related to the all-sky skew spectrum, and is a 
computationally-efficient way to estimate the latter. By suitable choice of weight functions it can be made optimal. It is valid for a completely general 
mask where wi represents the spherical transform of the mask. 

The transformation matrix M u i used here is the same as that we introduced for the recovery of cross power spectra. The power spectra associated 
with the mask wi, plays the same role in construction of M u i For simplicity we have assumed that different data sets have the same mask but it 
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is trivial to generalise for two different masks. A detailed compar ison of level of sub-opti mality will be compared with numerical simulations in an 
accompanying paper. This analysis is complementary to work by|ch en & Szapudi (2006) where a similar sub-optimal estimator was used to study 
non-Gaussianity. 



4 CMB SECONDARY BISPECTRUM 

The formalism developed so far is quite general and can handle mixed bispectra of different kinds. The main goal was to relate the skew spec- 
trum with the corre s pondi n g mixed bispectrum. To m ake c oncrete predictions w e need to consider a specific form for the bispectrum. Following 
Spergel & Goldberg ( 1999), Goldberg & SpergeJ i 1999b and lCoorav & Hul J2000h we expand the observed temperature anisotropy <5T(fi) in terms of 
the primary anisotropy STp, and due to lensing of primary, <5TL, and the other secondaries from coupling large-scale structure, 8Ts. 

ST(Q) = <5T P (0) + <5T L (fi) + 5T S (Q). (28) 

Expanding the respective terms in spherical harmonics we can write: 

STp(Q) = J2a lm Y lm {Cl); ST h (£l) EE ^ a, m V9(0) • VT B (ti); 5T 3 (fl) = ^b lm Y lm (Q). (29) 

Im Im Im 

The harmonic coefficients bi m are associated with the expansion of the secondary anisotropics 5Ts(Cl). The secondary bispectrum for the CMB then 
takes contributions from many products of P, L, S terms. For example, one term arises from products of STpST^STs: 

BfJX ee V [ h h h ) / («5rp(^ 1 )5T L (n 2 )«5r s (n3))^ 1 m 1 («i)yi 2m2 (fi2)F ;3m 3(n3)dnidn2dn3 

' ^ V ^1 TTt<2 ^3 I J 

= E ( m\ m 2 m 3 ) ((^ W^W^ W- (30) 
It is possible to invert the relation using isotropy of the background cosmology: 

{{STp)i imi (STp)i 2 m 2 (STp)i 3 m 3 ) = ( „i 2 l J[ ) B h i 2h . (31) 



mi m 2 rrii 

Explicit calculations, detailed in iGoldberg & Spergell d 19991) and lCoorav &~Hu1 [2000), found the mixed bispectrum to be of the following form: 



B hhh = ~ V>h c h ' ~' " 1 '° ± 2 ^' "° V " J ' ^' + cyc.perm.j ^ v "" 1 ' ^'""^ — I q q q J = b hh i 3 I h i 2 i 3 , (32) 

where we have defined the reduced bispectra bi 1 i 2 i 3 , which is useful in certain context dBartolo. M atarrese & Riottoll2006h . and the additional geomet- 
rical factor, which originates from the integral involving three spherical harmonics over the entire sky: 



h{h + l)-h{h + l)-h{h + l) , 1 /(2ix + l)(2i2 + l)(2J3 + l) f h h h 



(2/i + l)(2Z 2 + l)(2« 3 + l) ( h h h 



fi ^ 5 V 4^ \Q I' 

The cros s-correlation power-s pectra appearing in the above expression denotes the coupling of lensing with a specific form of secondary non-anisotropy 
(see e.g. lCoorav & Hul d2000h ). 

(Ql'm'blm) =blS W S mm i. (34) 

The bispectrum contains all the information at the three-point correlation function level and can be reduced to one-point skewness or the two-point 
collapsed correlation function or the associated power-spectra, the skew spectrum. Here we have considered the secondaries of the CMB, however the 
analysis holds if external tracers such as the radio galaxy surveys such as NVSS or 21cm observations are used instead. In the next few subsections we 
will discuss the problem of estimation of the skew spectrum in a nearly optimal way. This will lead to a discussion of development of optimal techniques 
in subsequent sections. We will also tackle the problem of joint estimation of several bispectra and associated estimation errors. As is known and will 
be discussed in the followin g sections that the estimation of CMB bispectra is similar and related to the case of lensing reconstruction of the CMB sky 
JSmifh. Zahn & Dorell2007l) . 



5 ESTIMATION OF SKEW SPECTRA 

The problem of estimation of the skew spectrum is very similar to that of the primary CMB bispect rum. There has been a re c ent surge in activit y 
in this area, driven by the claim o f detection of non-gaussianity in the WMAP data release (see e.g lYadav & Wandeltl J2008h : lYadav et aT] {2008); 
lYadav, Komatsu & Wandeitl J2007t) ). Different techniques were developed which introduce various weighting schemes in the harmonic domain to make 
the method optimal (i.e. saturates the Cramer-Rao bound). Maps are constructed by weighting the observed CMB sky with /-dependent weights obtained 
from inflationary theoretical models. These weighted maps are then used to compute one-point quantities which are generalisation of skewness and 
can be termed mixed skewness. These mixed skewness measures are useful estimators of /jvl parameters. A more general treatment was provided 
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in lSm ith & Zaldarriagl] d2006l) :[Smith. Zahn & Dore (2007i): ISmith. Senatore & Zaldarriagj j2009l) who took into account mode-mode coupling in an 
exact way with the u se of proper inverse-covari ance weighting of harmonic modes. 

Recent work by Munshi & Heavens (2009) has improved the situation by focussing directly on the skew spectrum. Their technique does not com- 
press all the available information in the bispectrum into a single number but provides a power-spectrum which depends on the harmonic wavenumber 
I. This method has the advantage of being able to separate various contributions as they will have different dependence on is, thus allowing an as- 
sessment of whether any non-gaussianity is primordial or not. In this section we compute the contaminating secondary bispectrum contributions from 
lensing-secondary coupling. 



5.1 Bispectra without line of sight integration involving ISW-lensing RS-lensing and SZ-lensing 

The study of the bispectrum related to secondary anisotropy (see ICoorav & Setr] feOOd) for more details and analytical modelling based on halo 
model which we use here) is ar guably as important as that generated by the primary anisotropy. Primary non-Gaussiani t y in simpler infl ation- 



model which we use here) is arguably as important as that generated by the primary anisotropy. Primary non-Lraussianity in simpler innat 
ary models is vanishingly small Isiriopek & Bond 1 19901 . Il99ll : iFalk et all [l993l : iGangui et al] 1 19941 : lAcquaviva etall 120031 : lMaldacenall2003l) : 



iBartol o. Matarrese & Riotto (2006) and references therein for m ore details. However, variants of simple inflationary models such as multiple scalar 
fields dLinde & Mukhanovlll997l:lLvfh. Ungarelli & Wandsll2003l), features in the inflationary potential, non-adiabatic fluctuations, non-standard kinetic 



terms, warm inflation (Gupta, B erera & Hea vens 20021: iMoss & X iong 2007), or deviations from Bunch-Davies vacuum can all lead to a much higher 

level of non-Gaussianity. 

Early observational work on the bispectrum from COBE l Komatsu et alJI 20021) and MAXI MA dSantos et alj|2003l) was followed by much more 



accurate analysis with WMAP dKomatsu etal]|2003l : ICreminelli et al.ll2007l : ISpergel et al.l l2007). The primary bispectrum encodes information about 
inflationary dynamics and hence can constrain various inflationary scenarios, where as the secondary bispectrum will provide valuable information 
regarding the low-redshift universe and constrain structure formation scenarios. These bispectra are generated because of the cross-correlation effect of 
lensing due to various intervening materials and the secondary anisotropy such as the Sunyaev-Zeldovich effect due to inverse Compton scattering of 
CMB photons from hot gas in intervening clusters. 

b hhh = -- [ {(h(h + 1) - k(h + 1) - Hh + l^Cf^a + cyc.perm. ] . (35) 

The power spectrum Cf is the unlensed power spectrum of the CMB anisotropy. We have introduced the subscript S to distinguish it from the C;s that 
appear in the denominator which take contribution from the instrumental noise from signal to noise computation point of view (Ci — Ci + N/b(l) 2 , 
where TV is the instrumental noise and b(l) is the beam function in multipole space). We define the different fields which are constructed from underlying 
harmonics and corresponding C;s. These will be useful for constructing an unbiased near optimal estimator. 



A W _ 

A lm — 


a lm r S 


Im 


= 1(1+1) 


aim 

~eT ; 


r {i) _ aim, 
Cl 


A lm — 


J(J + l)^Cf; 




R (2) = 

Im 


aim 


„(2) _ aim. 


A lm — 


aim r s 

~P> w ; 


R (3) 
B lm 


dim 




C&> =1(1 + 1)^6 



(36) 

The corresponding fields that we construct are (fa) = Yi m (fa)A^ l , and in an analogous manner and C' 1 ' . The optimised skew spectrum 
in the presence of all-sky coverage and homogeneous noise can now be written as: 

Cf' 1 = ^ T ^^Real{(A (!) (n)B (l) (n)) im C (l) *(A) im } + cyc.perm. (37) 

m i 

The cyclic terms that are considered here will have to constructed likewise from the corresponding terms in the expression for the reduced bispectrum 
discussed above[35] The linear-order correction terms which needs to be included in the absence of spherical symmetry due to presence of cuts to avoid 
the galactic foreground and the inhomogeneous noise can be written as: 

£72,1 _ 1 [gAB,C _ gA(B,C) _ qB(A,C) _ g(AB),C 

fskv ^ L 1 



(i) 

+ cyc.perm. (38) 



The terms without averaging such as Cf B,C are direct estimates from the observed partial sky with inhomogeneous noise. The Monte Carlo corrections 
such as (7 ; A ( S ' C ) are constructed by cross-correlating the product of the observed map A and a Monte Carlo map B with a Monte Carlo map C and 
then taking an ensemble average over many realisations. The denominator f s k y , which represent the fraction of the sky covered, is introduced to correct 
for the effect of partial sky coverage. This is an approximate way to treat the mode-mode coupling due to partial sky coverage and known to be a good 
approximation for higher I. The skewness associated with this form of bispectra can be expressed as a weighted sum of the corresponding C'is: 

s=Y^w +1 W 1= Y, *M*S; h - (39) 



ihh 



Constructing such weighted maps clearly can be seen as a way to construct a matched-filter estimator for the detection of non-Gaussianity. It is optimally 
weighted by the inverse cosmic variance and achieves maximum response when the observed non-Gaussianity matches with a specific theoretical input. 
The skew spectrum also allows for analysis of more than one specific type of non-gaussianity from the same data - allowing a joint analysis to determine 
cross-contamination from various contributions. 
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5.2 Bispectra involving line of sight integration: The Ostriker-Vishniac effect and its correlation with other secondary anisotropics 

Another set of se condary bispec tra involving any of the Ostriker-Vishniac (e.g. see IJaffe & Kamionkows kj]2004b)) effect. SZ thermal effect, or the 
kinetic SZ effect (Coorav 2001b) or a combination of these have the following form of reduced bispectrum dCoorav & HukOOOt) . which involves a line 
of sight integration along r: 

b hhh = J dr f h {r)gi 2 (r) + cyc.perm. (40) 

The construction of weighted maps follow the same principle with the use of kernels fi(r) and gi(r) that are associated with any of the scattering 
secondaries that involve a line of sight integration. Numerical implementation of line of sight will naturally have to deal with an optimal method to 
include the quadrature. Defining 

>Wr) = ^/,(r); B lm (r) = ^gi(r); C lm {r) = ^. (41) 

tl Li Li 

A and B are fields constructed from the generic function represented by /; and gi. Following lMunshi & Heavensi j2009h we construct 

c ~ ;2 ' 1(r) " 2TTI / drr ^ Real{(AB)im ^' r)c( ^' r)ira} + cyc - perm - (42) 

m 

and from this compute the skew-spectrum: 

C?' 1 = J drCf\r). (43) 

This is the generalisation of the all-sky estimator of the skew spectrum of lMunshi & Heavensi d2009t) . but for three distinct fields. 
The corresponding one-point skewness can be written as 

S = J> + l)cr = £ B 'rfr lt2 - (44) 

I llil 2 



In the next section we consider the contamination of the primary skew spectrum by secondary non-Gaussianity from point sources. As before we have 

anic coeeficients of the 
theoretical CMB powerspectra Cf , i.e. Ci = Cf + Cf 



absorbed the beam in the harmonic coeeficients of the data vector a; m . As before, C;s also take contribution from the noise Cj as well as from the 



5.3 General Expression 

From the examples above, its clear that from a very general consideration if the reduced bispectrum can be decomposed in such a way it consists of 
terms, which can be used to construct fields such as , B^ and (not necessarily of a specific form) a skew-spectrum can always be constructed 
by similar manipulation. In certain cases might actually also have radial dependence, in which case a line of sight integration needs to be 

performed to match observations. 

^ 1= (2lTT)EE (2t, + ^f + 1) (o M^^^K^^+cyc.pen^K^C^cyc.perm.] (45) 

VI" ij v 7 



5.4 Cross-contamination from Point Sources and Primary non-Gaussianity 

The bispectra associated with point sources is modelled as &f S ; 2 ; = const. The constant depends on the flux limit. More complicated modelling which 
incorporates certain aspects of halo models can be used for better accuracy Serra & Coorav ( 200^). 

cc c - (46) 

Similarly given a model of primary non-Gaussianity one can construct a theoretical model for computation of Bff*™ (see Munshi & Heavens 2009 for 
more about various models and construction of optimal estimators). While study of primary non-Gaussianity is important in its own right for the study 
of secondaries they can confuse the study. 

Typrim psec 

S = E err ^ 

Similar results hold at the level of the skew spectrum. A more general treatment based on Fisher analysis of multiple bispectra is presented in the 
subsequent sections. 

In addition to various sources mentioned above, second-order corrections to the gr avitational potential through gravitational instability too can also 
act as a source of secondary non-Gaussianity dMunshi. Souradeep & Starobin skv 1995). 
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6 OPTIMISED ANALYSIS OF MIXED BISPECTRA 



Starting from lBabichl 1 2005b a complete analysis of bispectrum in the pr esence of partial sky coverage and inhomogeneous noise was developed by 



various authors jBabichl 20051 : ICremineilTe t al. 2006; lYadav et alj|2008l). A specific form f or a bispectrum estimator was intr oduced which is both 



unbias ed and optimal. This was further developed and used by Smith, Zahn & Dore] d2007l) for lensing reconstruction and by ISmith &"Z aldarriaga 
(2006) for general bispectrum analysis, for one-point estimator for /nl- The analysis depend s on finding suitable inverse cosmic variance weighting 
C _1 of modes. It deals with mode-mode coupling in an exact way. In a recent work iMunshi & Heavens! J2009I) further extended this analysis by 
incorporating two-point statistics or the skew spectrum which we have already introduced above. We generalise their results in this work for the case of 
mixed bispectra for the case of both one-point and two-point studies involving three-way correlations. The analytical results presented here are being 
kept as general as possible. However in the next sections we specialise them to individual cases of lensing reconstruction and the mixed bispectrum 
associated with lensing and the SZ effect as concrete examples. 

6.1 One-point Estimator: Mixed Skewness (X(fl)Y({l)Z(n)) 

We are interested in constructing an optimal and unbiased estimator for the estimation of mixed skewness (X (Cl)Y (Cl) Z (Cl)) . The fields X(Q.) = 
Sim Xi m Yi m (£l) and similarly for Y and Z, are defined over the entire sky, though observed with a mask and nonuniform noise coverage. The non- 
uniform coverage imprints a mode-mode coupling [Cxx]^ m i 2Tn2 m the observed multipoles of a given field in the harmonic space (Xi irni Xi 2nl2 ). 
For the construction of the optimal estimator it will be useful to define Xi imi as 

X hmi =[Cxx]T imi ,i ama X latn. a - (48) 

Here X\ m represents the harmonics of the data X with inverse covariance weighting. Next we need to deal with the covariance matrix of the modes 
Jfijmj in terms of that of Xi imi The auto covariance matrix for X, Cxx, and that of X, Cxx are related by the following expression: 

[Cxx]l a m a ,l b m b = (Xl a m a Xl bmb ) = [C xx ]l a , na ,l b ,n b . (49) 

Similarly, the cross-covariance for two different fields X and Y with inverse variance weighting, in harmonic space can be written as: 

Cxy = {Xi a m a Yi bmb ) = C xx CxyC Y y- (50) 

The estimator that we construct will be based on functions Q[X ,Y , Z] which depends on the input fields, and its derivatives w.r.t. the fields e.g. 
dQ[X, Y\/dZi m - The derivatives are themselves a map with harmonics described by the free indices Im, and are constructed out of two other maps. 
The function Q on the other hand is an ordinary number which depends on all three input functions and lacks free indices. 

Q[X,Y,Z] = - ^ B{f,J,F ( ^ m „ ^Xi m Y Vra iZ V t m n (51) 

Iml' m' I" m" ^ 

dQ[X,Y] _ \- vXyz ( I I' I" , v .. 



dZ lm 111 \ m m m 

Iml m I m 



Similar expressions hold for other fields such as Xi m , Y\ m Introducing a more compact notation Xi, where x\ — X, X2 = Y, x$ = Z we can write the 
one-point estimator for the mixed skewness as: 

E[xi] = I |Q[xi]-^[x]t ma (a| ama Q[^])| . (53) 

This is a main result of the paper, generalising work bv lSmith & Zaldarriagal J2006h to mixed fields. 

The ensemble averaging {) in the linear terms represents Monte-Carlo averaging using simulated non-Gaussian maps. The associated Fisher matrix 
(a scalar in this case) can be written in terms of the functions Q[x^], its derivative and the cross-covariance matrices involving different fields. 

Here we have used the shorthand notation for (i] ima xj m ) = [C _1 ]; J m i b m b - ^ n case of joint estimation of different bispectra from the same data 
sets we can extend the above discussion and write: 

E[xt] = F~\ p {Q"[xi] ~ [x]Lm a (dl ma QP{x t })} . (55) 

Here the Fisher matrix F a p encodes the inverse estimator covariance for different mixed bispectra B a and B 13 , a and (3 represents different types of 
bispectra recovered using the same data sets. 



F a p= 2^ B hl2h B l i l 5 l< i 



[C jiimi ,Z 4 m 4 [C ]l 2 m, 2 ,l5m 5 [C ]l 3 m 3 ,l 6 m 6 + 



+ 

+ [C ]l 1 m 1 ,l 5 m 5 [C ]l 2 m 2 ,lem 6 [C ]l 3 m 3 ,l i m i + [C ]l 1 m 1 ,l e me[^' ] h m 3 >'5™5 \P ]h™2,U 



(56) 
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Figure 4. The cross-spectra fe; introduced in Eq. d32t required for the construction of the bispectra is plotted for ISW cross lensing (right panel) and SZ cross lensing (left 
panel) bispectrum as a function of I. See text for details. 

The cyclic permutations here represent two additional terms with permutations of super-scripts X, Y, Z along with associated subscripts. The Fisher 
matrix (which is a number in this particular case) for the mixed bispectrum in case of all-sky coverage and constant variance noise can be expressed as: 



E, _ 1 \ nXYZ nXYZ 



QXX pYYpZZ 



+ 



pXX 



pXXpYY 



+ 2 



C 



c 



c 



ZZpYY i > A A ,->.VX QXXpYY 



Cf z C YY C xx Ct 

J 2 'a *2 t 



(57) 



6.1.1 Special Case (A): Z=Y, (X(D.)Y 2 (Q)) 

In certain practical situations we will encounter cases where two of the three fields are identical. The corresponding Fisher matrix can be recovered by 
simply setting Z = Y . 



6 t-~> 



r,r>XYY n XYY 

^Ixl^H^lxliH \ -£xx 



C YY 

>2 



,ORXYY dXYY 



r x\ 



C XX C YY 

h h 



c, 1 

f'2 



(58) 



6.1.2 Special Case (B): Z=Y=X, (X 3 (Q)) 

Finally, if we identify all three fields to recover the case ordinary or pure bispectrum corresponding to the case X — Y — Z. 

Faff — g ^ ~i ^hhl3^UlF,ln ]iim 1 ,i 4 m 4 [C ] l 2 m 2 ,(5 m 5 [C ]l 3 m 3 ,l 6 m e - 

In the limit of all-sky coverage and constant variance noise the estimator reduces to: 

t>XYZ r>XYZ 



1 R^ 1 R^ 1 ' 

aP 6 2_, c h c h c h 



(59) 



(60) 



— 1/2 

For high Is a scaling f sky ' is sufficient to describe the effect of partial sky coverage on the error covariance matrix. 
6.2 Two-point Estimators: Mixed skew spectrum 



We begin by constructing the functions Ql and its derivative w.r.t. various input fields. We use these to construct an optimal and unbiased estimator to 
correlate the field X(fl) with the product of two such fields Y(Cl)Z(Cl). We consider the most general possible case of the skew spectrum associated 
with the mixed bispectrum B XYZ . 



[X,Y, Z] = ^Ilm ^2 B u'i" i m l m , l m i, ] i',-,„-Z,",„" 



df m QL[Y,Z]=8u 



EBu'l" [ „ 1 11 ) Yl'm' Zl"m" 

\ m m m 



dT m Q L [X, Z]=J2 Xlm b 



Lir \ M m m i ) Zp m r, df m Q L [X,Y] ' 



Xlm *S B 



L I I , 

M m m 



(61) 
(62) 
(63) 
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While Ql is a number (cubic function of input maps) for a given L, the derivatives are maps which are quadratic in the input maps. The derivatives will 
be important in constructing the linear terms which are important in reducing the variance of the estimator in an absence of spherical symmetry, which 
is the case in the presence of inhomogeneous noise or partial sky coverage. Using these expressions we can write down the optimised bispectra 

E^ YZ [x i ] = [N- 1 }^ z \Q L ,[xi}- ^ [xi]im(dLQ L '[xi])Mc) > ■ (64) 

I »=1,2,3 J 

The normalisation matrix N LL , is related to the Fisher matrix F LL , = N~^,, and can be expressed as: 

N iv Z = \ ({di^Qiix}} [C^l^f {di 2m2 Q L '[x]}) - l{(di imi Q L [x})} [Cf^i^f {(^U^'K)} • (65) 
Finally the Fisher matrix can be written as: 

yXYZ r>XYZ 

L,1 'i i,|2, U M mi m' 1 ) { M' m 2 m' 2 



Z7 \ ^ \ ^ nXYZ nXYZ ( L l\ l[ \ ( L' h l'l 

b LL'= 2^ 1^ B Lhl' 1 H L'l 



MM' lil'.mi-m'. 



X q{[C XX ]lM,L'M' [C YY ]l 1 m 1 ,l' 1 m' 1 [C* Z ]l 2 m 2 ,l' 2 m' 2 + [C^ ]lM [C Y Z ]l imi ,L> M' [C^ '* '}l 2 m 2 ,l' 2 m' 2 + 
[C XZ }LM,l' 2 m' 2 [C ,Zy ]i 2 m 2 ,!' 1 m' 1 [C* ' * ]l imi ,L' M' + [C XX ]lM,L'M' [C ,Zy ]i 2 m 2 ,i' 1 m' 1 [C^ }lM ,l' 2 m' 2 + 

[C liimi^m'JC ]LM,l' 2 m' 2 [C ]l 2 m 2 ,L> M ' + [C \l 2 m 2 .l^m'JC }LM,l' 1 m' 1 [C ]l imi ,L'M> }. (66) 

In the case of near all-sky experiments the off-diagonal elements of the Fisher matrix will be relatively smaller. The diagonal elements as before can be 
scaled by f sky (the fraction of the sky covered by a near all-sky experiment). The covariance matrices can now be expressed only as a function of the 
related C;s the auto- and cross-correlation power spectra: 



Fr.rj = 5 
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}■ (67) 



The other two terms represented by cyc.perm. consist of terms with suitable permutations of superscripts X,Y and Z. In deriving these expressions 
all-sky limits of Cf^ Vm , = (l/C xx )5 w S mrn i were used along with the fact that we can write Cf Y Vm , = {C XY /C xx Cf Y }S W 5 mm , for all sky 
case. For the case when the cross-correlation among two or more fields vanish the expression simplifies considerably. 
In case of joint estimation of several bispectra from the same data one can write the following expression: 

£2N =EE'^ 1 '"' I - E l^UidimQU^Mc) 1 . (68) 

L> (3 \ i=l,2,3 J 

Here the indices a and f3 correspond to various power spectra [B x Y 2 f 3 ] a or \B x Y 2 f s \ which are associated with bispectra that can be jointly estimated 
from the same data [xi]. Below we consider two special cases for the skew spectra that we have considered so far. The expressions for F"^, and N^, 
can be obtained simply by replacing the product B x ^ z B x ^ z by [B XY f 3 ] a \B X -J^^t ■ m certain situation when accurate noise modelling is difficult 

or unlikely an approximate proxy for C _1 is used in the form of a regularization matrix R which acts as a smoothing of the data. The resulting data 
vector if = Rxi is now used for developing a unbiased but suboptimal estimator by replacing Xi with if. 



6.2.1 Special Case (A): Z=Y 

The estimator in this case corresponds to E X ' Y 



= 2<5 ll' ■! B XYY B XYY „ vv „ vv ^ YY | + 1 2 B XYY B XY 



/ j I "L-W LiW pXX nYY nYY f 1 / , I "LL'l ^ L' I pXXnYY ] \ pXXpYY ] pYY 



IV 

XYY j->XYY 1 / \ I M \ , r,XYY r,XYY 



'LL'l D ILL' pYY I qXXqYY I I QXXpYY j ' C 'LIL' ^LUL pYY 1 qXXqYY j \ Q 



^c YY )) 



(69) 
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Figure 5. The cross-spectra 6; introduced in Eq. )32t required for the construction of the bispectra is plotted for point source(PS) cross lensing (right panel), ISW cross 
lensing (middle panel) and SZ cross lensing (left panel) bispectrum as a function of I. See text for details. 



6.2.2 Special Case (B ): Z= Y=X 
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(70) 



6.2.3 Special Case (C): Z=X 

The estimator we consider here is Ef~' XY can not derived by simple identification of superscript. We define the new functions related to the estimators 
Ql, dhnQh and dY m Qh according to the same prescription above. 
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The estimator in this case takes the form: 
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I ^l'm'Yl"r. 



(71) 
(72) 
(73) 

(74) 



LL ~ U LL 



Ef tjXYX nXYX 1 1 1 \ , f nXYX R 

<y a LW £>LW QXXQXXQXXj + \^LU' & 



XYX 1 

UV T^xx 



C xx \C XX CJ 



c xx q, 



, V / n XYX u XYX ( C * Y i ( C l' Y i 

+2^{B LLn b l , u ^ c xx c y l y J \ci, x cir ) 



, r,XYX r,XYX 1 
+£>LL'l £>L'IL 



rxx 



qYX 

WW 



nXXpYY 
qXY 



r>XYX ryXYX 
E>LIL' tJh'll 



fXX 

, U XYX n XYX 1 1 1 1 

+ &LIL' a L'lL ^XXQXXQYYj 



}) 



(75) 



(76) 



7 SPECIFIC EXAMPLES 

The discussion so far has been completely general. We specialise now for a few practical cases of cosmological importance. These correspond to the 
study of mixed bispectra associated with lensing induced correlation of secondaries and CMB as well as frequency cleaned SZ catalogs against CMB 
sky. 
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7.1 Lensing Reconstruction 

7.1.1 One-point estimator: 

Various es timators associated with len sing reconstruction were introduced by different authors, e.g. dHull2000l : iHu & Okamotol koCg)). It was recently 
studied bv lSmith. Zahn & Dord fco07t) and was used to probe effect of lensing in CMB by cross-correlating with external data-set such as NVSS survey 
against WMAP observations. 

Slm3 = 2^ ^ B S'3 [Vl^™!*3™3 - [C^T^rm^^hms] ■ ( 77 ) 

This is achieved by writing th e reconstructed lensing pot ential in terms of the CMB harmonics and cross-correlating it with low-redshift large-scale 
tracers such as galaxy surveys ( Smit h. Zahn & Doref2 007). The bispectrum B l ^ depends in addition to the Cis of the CMB multipole, on the cross- 
correlation between the CMB sky 8(0.) and the low-redshift tracer field if>(fi). The reduced bispectrum of interest &f*j^j 3 and the related form factor 
/z 1 ! 2 i 3 can be written as: 

°hhl 3 ~ yhhh L h + Jhlih c h j c i 3 

fhhh = + 1) + Hh + 1) - + 1)} (78) 

The multipole 8i imi and Si 2Tn2 are associated with the CMB sky and ipi 3 m 3 is the multipole associated with the large-scale structure tracer at low 
redshift and hence correlates with the lensing potential (e.g. NVSS survey). The above estimator directly probes the cross-correlation between the 
lensing potential harmonics (f>i m constructed from temperature harmonics 8i m and the harmonics of the tracers ipi m . It is interesting to notice that the 
estimator constructed lacks the term which signifies the correlation between 8(Cl) and through the coupling C" 5 *. Though the bispectrum itself 
depends directly on the cross-power spectra. Using the results derived before we can write the Fisher matrix associated with this estimator can be 
written as: 

F = F' = -V B? 6 ?, Bf s * [C^ir 1 , [C"]," 1 , [C^l, -1 , • (79) 

2 / j (1*2*3 (4'5'6 L Jiim 1 ^4m 4 L H 2 m 2^^ m ^ v J t3 m 3>'6 m 6 



7.1.2 Estimators for the skew spectrum 

If instead of the one-point estimator described above, we compute the two-point estimator or the skew spectrum as follows: 



E L 



[8, i>] = [N- 1 ]^, {Q L , [8, i>] - ^{df m Q L , [8, i>]) } . (80) 



The corresponding expressions for the functions Qu [<j>] and dimQu \<j>, V 1 ] are given by: 

QL = y^i>LM 8i m 8i> m > ( ' i,/ I ; 9 f m Q L = y^iptii's^ B s j^f,<j>ii m i ( ^ 1 1 , \. (si) 

Z—d \ m m m I ^-^ ' L " \ m m m I 

M lm,lm V ' LM I'm' V ' 

Corresponding Fisher matrices can which turns out to be diagonal can be written as: 

F LL ' = N£l = {<^ m Q i [C**] Im> „ m /S* m/ Q i ,> - (dtmQL)\C*%m,l>m'{dtm'Qv)} • (« 2 ) 

which finally leads us to: 

P — _ \ ^ R*' 5 "/' r>86i> r^^^i -1 \ ( ~i<t>4>]~ 1 r^V"/"] -1 ra« 

_ 2 ' / l ~< hL ItUL' IV Jiimi,i 3 m3 IV J l 2 m 2 ,141714 IV \LM,L'M'- k ° d > 
IV 

In the limit of all sky survey and homogeneous noise we can write: 

F LL , = 8 LL , Y, Bf^BfZ (J^J {J^J (J^j (84) 

A comparison with the previous estimator shows the presence of off-diagonal entries in the Fisher matrix even if direct correlation between 8 and ip are 
absent in the estimator. 

It is possible to work with CMB sky without external data sets (such as NVSS or other galaxy surveys) to probe weak lensing e.g. the power 
spectrum of the lensing potential itself is related to four-point statistics of the temperature - which makes it noise dominated, use of external tracers 
such as galaxy surveys can reduce the problem to three-point level thus lowering the need on s ensitivity of the instrument . The discussion above can 
have direct relevance for use of other tracers such as the one with neutral hydrogen observations jZahn& Zaldarriaga 2006b. 
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Figure 6. The cross-spectra 6; introduced in Eq. d32t required for the construction of the bispectra is plotted for ISW cross lensing (right panel) and SZ cross lensing (left 
panel) bispectrum as a function of I. See text for details. 

7.2 Sunyaev-Zeldovich-CMB 2 mixed bispectrum 

The secondary bispectrum caused by the Sunyaev-Zeldovich effect is one of the most pro nounced sec ondary bispectrum among ma ny others 
dGoldberg & Spergellll999l ; ISpergel & Goldbergf 19991 ; ICoorav & Hul200rj ; |Coora^l200d . l2001 alfbh . Following lCoorav" Hu& Tegmarkl (;2000) we study 
if frequency cleaned maps of all-sky CMB and SZ maps can also be used to construct power-spectra associated with the mixed bispectra with signal-to- 
noise ratio that can be detectable with ongoing CMB experiments. It probes mode-coupling effects generated by correlation involved with gravitational 
lensing angular deflections in CMB and the SZ effects due to large-scale pressure fluctuations. As before the estimator which can be constructed from 
the CMB ai m and the Sunyaev-Zeldovich si m multipoles. There is a possibility of constructing the correlating the product map a(Q)s(Cl) with a(Cl) 
as well as a(Cl) and s(Cl) 2 . In terms of the suboptimal estimators introduced before the correspond to C^ s ' a and Gf' a respectively. In the second case 
analysis is exactly same as that of lensing reconstruction discussed before. However in the first case the optimal estimator is expressed as follows: 

E s £ aa [a, s] = [iV _1 ]Lz/ {Q L >[a.,5] - ai m (d? m Q L >[a,s]} - si m (dt m Q L i[a, s]}} . (85) 

The above estimator considering is same as Eq.J68t: with the corresponding Q L i [a, s] function and its derivatives are given in Eq.<l63t. 

The mixed bispectrum of CMB 2 — SZ is known to be exactly same as that of bispectrum we considered in the lensing reconstruction. This is true not 

only for SZ-lensing bispectrum but for other lensing-induced correlation-related bispectra too. The only difference is in different b;s involved. 




This is an application of the case considered in Eq. ( 1691 ). For the one-point mixed skewness associated with this power spectra the related Fisher error 
is simply given by the sum over all the elements. F = ^2 L L , F LL i . The cross correlations C" a between the two maps s(fi) and a(Cl) introduces the 
off-diagonal elements in the Fisher matrix even for the case of all-sky coverage and homogeneous noise. Ignoring the correlations we can recover the 
Fisher matrix elements derived for the case of lensing reconstruction. 

In addition to considering the cross-correlation of s(Cl) and a 2 (Cl) as discussed above, the other estimator of non-Gaussianity that we can consider 
is by considering the cross-correlation of product field s(Cl)a(Cl) and a(Cl) which is same as the estimator is same as that defined in Eq. l |74t with the 
relevant Q term and its derivative given by Eq. d73b - The associated Fisher matrix is given by Eq. J76t . The one-point estimator recovered from both of 
these degenerate estimators will be the same. 



8 CONCLUSIONS 

Extending previous work for estimation of power-spectra from correlated data sets we show how pseudo-C; -based approaches (PCL) can be used 
for estimation of cross-correlation power spectra from multiple cosmological surveys through a joint analysis. Analytical results were derived under 
very general conditions using an arbitrary mask as well as arbitrary noise properties. We also keep the weighting of the data completely general. Our 
analytical results also include a systematic analysis of covariance of various deconvolved Cis characterizing auto- and cross-power spectra from a joint 
analysis. While PCL-based approaches are known to be unbiased they are not in general optimal. However they can be made to act in a near-optimal 
way by the introduction of weights in different regimes corresponding to signal or noise domination. These studies will be useful in analyzing simulated 
as well as real survey data either in projection or in 3D. We specialise these expressions to recover well-known f s t y approximation used in the literature 
for the error analysis. Using a halo model inspired approach we compute the expected cross-correlation signal in cross-correlating NVSS type survey 
with the CMB sky through the ISW effect. We also study the cross-correlation between the frequency-cleaned SZ surveys against the ISW effect. The 
cross-correlation study also provides the covariances among different estimated C;s and the signal-to-noise of detection for a specific survey. However 
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we want to stress that the formalism developed here is more powerful and can tackle many issues in analysing realistic surveys. A detailed study using 
simulations will be presented elsewhere. 

The analysis of the bispectrum is one step beyond the power-spectrum and provides additional cosmological information. The primary motivation 
to date has been to put constraints on early-universe scenarios, however secondary bispectra can play a significant role in enhancing our understanding 
of large-scale structure formation scenarios. The secondary bispectrum is mainly related to mode-coupling by seconda ry effects and lensing. We study 
various statistics which can directly handle realistic data sets. Extending previous work bv lMunshi & Heavens! d2009l) . we take into account multiple 
correlated fields which are used for constructing a mixed skewness at one-point level as well as constructing a skew spectrum at the level of two- 
point. A very general framework was developed for the study of bispectrum from correlated fields in an unbiased and optimized way. We introduce the 
inverse covariance weighting and specialize our results for the analysis of bispectrum originating from lensing-secondary correlations. A simple-minded 
approach which handles the noise and partial sky coverage in a nearly optimal way using Monte Carlo techniques is also discussed. We also develop an 
approach based on PCL to study the skew spectrum. This approach, whilst suboptimal, can handle the noise and partial sky coverage directly. It is also 
possible to use weights to make it near optimal in the limit of high I, and can be useful mainly because of its speed of handling MC realisations. In its 
most general form, the estimator El (equation d64t for the skew spectrum of mixed fields includes the effect of partial sky coverage and inhomogeneous 
noise, and provides a compact function which can be compared with theoretical models to identify the source of the correlations between the fields. 
The associated Fisher matrix (equation [66} allows statistical analysis of the the El estimates, allowing the estimation of the relative contributions from 
different physical processes. 

For specific examples we have focussed on probing the secondary non-Gaussianity with Planck type all-sky experiments and surveys such as 
NVSS. The signal to noise ratios for cross-correlation studies involving lensing potential and secondaries such as SZ and ISW would allow detection 
with Planck. However to differentiate among various effects one would need to go beyond cumulative signal-to-noise estimates and the statistics which 
we introduced here will be useful diagnostic tools. 

There has been lot of work by a number of authors to detect correlations between the WMAP CMB and large scale structures, which typically 
conclude with a constraint on the dark energy (accelerating universe). Analysis of secondary bispectrum has also been attempted. However, at this 
point, consistent simulations which can correctly take into account, the correlation between CMB and the LSS, and the impact of the LSS on the various 
observables is still remains to be developed. Though a patchwork of simulations are getting ready, we still lack suitable simulations which can be used 
both for cross-correlational analysis or the entire range of bispectrum analysis at the moment. Our approach can be invaluable in quantifying accuracy 
of such consistency check and eventually to put constrain on cosmology using real high resolution data. We have not taken into account the errors or 
residuals from foreground removals. Some of the foreground contaminations may well be correlated to various LSS tracers. These issues and how PCL 
based approach can tackle them will be dealt with elsewhere. 
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